2,524 research outputs found

    Lattice score based data cleaning for phrase-based statistical machine translation

    Get PDF
    Statistical machine translation relies heavily on parallel corpora to train its models for translation tasks. While more and more bilingual corpora are readily available, the quality of the sentence pairs should be taken into consideration. This paper presents a novel lattice score-based data cleaning method to select proper sentence pairs from the ones extracted from a bilingual corpus by the sentence alignment methods. The proposed method is carried out as follows: firstly, an initial phrasebased model is trained on the full sentencealigned corpus; then for each of the sentence pairs in the corpus, word alignments are used to create anchor pairs and sourceside lattices; thirdly, based on the translation model, target-side phrase networks are expanded on the lattices and Viterbi searching is used to find approximated decoding results; finally, BLEU score thresholds are used to filter out the low-score sentence pairs for the data cleaning purpose. Our experiments on the FBIS corpus showed improvements of BLEU score from 23.78 to 24.02 in Chinese-English

    Hürthle cell carcinoma: current perspectives.

    Get PDF
    Hürthle cell carcinoma (HCC) can present either as a minimally invasive or as a widely invasive tumor. HCC generally has a more aggressive clinical behavior compared with the other differentiated thyroid cancers, and it is associated with a higher rate of distant metastases. Minimally invasive HCC demonstrates much less aggressive behavior; lesions <4 cm can be treated with thyroid lobectomy alone, and without radioactive iodine (RAI). HCC has been observed to be less iodine-avid compared with other differentiated thyroid cancers; however, recent data have demonstrated improved survival with RAI use in patients with HCC >2 cm and those with nodal and distant metastases. Patients with localized iodine-resistant disease who are not candidates for a wait-and-watch approach can be treated with localized therapies. Systemic therapy is reserved for patients with progressive, widely metastatic HCC

    Regulation of cAMP responses by the G12/13 pathway converges on adenylyl cyclase VII

    Get PDF
    Regulation of intracellular cyclic adenosine 3’, 5’-monophosphate (cAMP) by multiple pathways enables differential function of this ubiquitous second messenger in a context dependent manner. Modulation of Gs-stimulated intracellular cAMP has long been known to be modulated by the Gi and Gq/Ca2+ pathways. Recently, the G13 pathway was also shown to facilitate cAMP responses in murine macrophage cells. We report here that this synergistic regulation of cAMP synthesis by the Gs and G13 pathways is mediated by a specific isoform of adenylyl cyclase, AC7. Furthermore, this signaling paradigm exists in several hematopoietic lineages and can be recapitulated by exogenous expression of AC7 in HEK 293 cells. Mechanistic characterization of this synergistic interaction indicates that it occurs downstream of receptor activation and it can be mediated by the alpha subunit of either G12 or G13. Our results demonstrate that AC7 is a specific downstream effector of the G12/13 pathway

    Understanding lamin proteins and their roles in aging and cardiovascular diseases

    Get PDF
    The occurrence of cardiovascular diseases increases with age independent of other risk factors, and the percentage of senescent cells is significantly elevated in vascular cells at atherosclerotic sites. Patients with accelerated aging syndromes caused by mutant lamin A protein, a structural component in nuclear lamina, also share many similarities with normal aged people, including the propensity to develop atherosclerosis. Recent studies have revealed the accumulation of prelamin A in normal aged vascular cells, and that lamin A participated as a mechanosensitive molecule in regulating various cellular events. These findings suggest that the ectopic expression of mutant lamin A or lamin A precursor (prelamin A) not only causes defects in cell mechanics, but it also disturbs stress-induced mechanotransduction pathways involving lamin A, both of which may contribute to vascular dysregulation. This review summarizes the current understanding of how lamin proteins are involved in vascular cell during aging, with a particular focus on the effect of mechanical stresses from blood flow on nuclear lamina of endothelial cells. Related studies are clarifying the role of lamin A in the progression of atherosclerosis, which will aid in the development of potential therapies for those suffering from lamin A-associated accelerated aging syndromes

    Retweet-BERT: Political Leaning Detection Using Language Features and Information Diffusion on Social Networks

    Full text link
    Estimating the political leanings of social media users is a challenging and ever more pressing problem given the increase in social media consumption. We introduce Retweet-BERT, a simple and scalable model to estimate the political leanings of Twitter users. Retweet-BERT leverages the retweet network structure and the language used in users' profile descriptions. Our assumptions stem from patterns of networks and linguistics homophily among people who share similar ideologies. Retweet-BERT demonstrates competitive performance against other state-of-the-art baselines, achieving 96%-97% macro-F1 on two recent Twitter datasets (a COVID-19 dataset and a 2020 United States presidential elections dataset). We also perform manual validation to validate the performance of Retweet-BERT on users not in the training data. Finally, in a case study of COVID-19, we illustrate the presence of political echo chambers on Twitter and show that it exists primarily among right-leaning users. Our code is open-sourced and our data is publicly available.Comment: 11 pages, 3 figures, 4 tables. arXiv admin note: text overlap with arXiv:2103.1097

    Geolocated Social Media Posts are Happier: Understanding the Characteristics of Check-in Posts on Twitter

    Full text link
    The increasing prevalence of location-sharing features on social media has enabled researchers to ground computational social science research using geolocated data, affording opportunities to study human mobility, the impact of real-world events, and more. This paper analyzes what crucially separates posts with geotags from those without. We find that users who share location are not representative of the social media user population at large, jeopardizing the generalizability of research that uses only geolocated data.We consider three aspects: affect -- sentiment and emotions, content -- textual and non-textual, and audience engagement. By comparing a dataset of 1.3 million geotagged tweets with a random dataset of the same size, we show that geotagged posts on Twitter exhibit significantly more positivity, are often about joyous and special events such as weddings or graduations, convey more collectivism rather than individualism, and contain more additional features such as hashtags or objects in images, but at the same time generate substantially less engagement. These findings suggest there exist significant differences in the messages conveyed in geotagged posts. Our research carries important implications for future research utilizing geolocation social media data.Comment: 11 pages, 10 figures, 2 table
    corecore